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We derive theorems which outline explicit mechanisms by which anomalous scaling for the proba- 
bility density function of the sum of many correlated random variables asymptotically prevails. The 
results characterize general anomalous scaling forms, justify their universal character, and specify 
universality domains in the spaces of joint probability density functions of the summand variables. 
These density functions are assumed to be invariant under arbitrary permutations of their argu- 
es ■ ments. Examples from the theory of critical phenomena are discussed. The novel notion of stability 
implied by the limit theorems also allows us to define sequences of random variables whose sum 
satisfies anomalous scaling for any finite number of summands. If regarded as developing in time, 
the stochastic processes described by these variables are non-Markovian generalizations of Gaussian 
processes with uncorrelated increments, and provide, e.g., explicit realizations of a recently proposed 
model of index evolution in finance. 
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I. INTRODUCTION 



A major achievement of the theory of probability are the limit theorems [1, 2], which provide the basis to explain 
statistical regularities observed in large classes of natural, economical and social mass-scale phenomena. These 
theorems describe the mechanisms leading to universal forms of scaling for the probability density functions (PDF's) 
' of sums of many independent random variables. The scaling can be normal, or anomalous, depending on whether the 
PDF's of the individual variables possess finite second moment, or not. However, independence is notguaranteed in 
general, and a large number of collective phenomena in Nature exhibit anomalous scaling [!, 0, M, BUBS Ell EL 
E3) O EH Ell as a consequence of correlations. In such cases, if the PDF of the sum of the elementary variables and 
its argument are simultaneously rescaled by a power D of the number of summands, it asymptotically converges to 
a scaling function g which is not necessarily Gaussian nor Levy, and the scaling exponent D is in general not equal 
to 1/2. Thus, an open challenge remains that of establishing limit theorems able to justify the existence and the 
universality of the anomalous scaling forms occurring in the case of strongly correlated variables. 

The renormalization group approach to critical phenomena in statistical physics 3] has led to developments in 
probability theory which point towards a solution of this problem. Indeed, the fixed-point condition for block-spin 
transformations can be regarded [T(| HH, EH as a substitute of the stability condition at the basis of the limit theorems 
for the independent case [HQ- For instance, in the context of hierarchical equilibrium spin models the fixed-points 
of these block-spin transformations are expected to attract whole domains of strongly correlated critical systems 
, displaying asymptotically the same universal form of anomalous scaling [Tol . [Til [l2j]. However, unlike in the case 
of the limit theorems for independent variables, classes of admissible universal scaling forms and their universality 
Q\ \ domains are not easily identified. 

Since the standard limit theorems hold in force of the multiplicative structure of the joint PDF's of independent 
variables, an attempt has been recently made by the present authors [l6j] to establish theorems on the basis of a gen- 
eralization of the multiplication operation, leading to dependent joint probability densities. Yet, due to mathematical 
difficulties, the problem of constructing consistent joint PDF's for correlated variables whose sum asymptotically 
satisfies scaling was not addressed [l6j] . 

Correlated random variables often considered in probability theory are those in exchangeable sequences [TtJ ■ The 
joint PDF's of an arbitrary number of variables in an exchangeable sequence have the property of being invariant under 
permutations of their arguments. Exchangeability was introduced by de Finetti [T8| . and is of paramount importance 
in the Bayesian approach to probability and statistics [T7j |. It is already known that, thanks to the simplifying feature 
of exchangeability, central limit theorems can be established [T^j . The scalings foreseen by these theorems for the PDF 
of the sum of the random variables involve scaling functions which are convex combinations (mixtures) of Gaussians. 
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For D only two values could be considered. If the variables are linearly uncorrelated, i.e., correlations are nonzero 
only for nonlinear functions of the variables, the scaling exponent is the ordinary D = 1/2 [l9| . Alternatively, if the 
variables are correlated also at linear level, limit theorems have been proved for D = 1 [201 ]. 

Inspired by ideas from the modern theory of critical phenomena, in the present Article we establish limit theorems 
for sums of N dependent random variables whose joint PDF's, upon increasing N, do not define sequences of random 
variables, in general. With those defining exchangeable variable sequences, our joint PDF's only share the property 
of being invariant under permutations of their arguments. To illustrate how PDF's with such properties can arise in 
physics, we discuss the example of a permutationally invariant description of a magnetic system. The novel theorems 
apply to anomalous scalings with general exponent D. They also enable the explicit construction of universality 
domains, i.e. of whole classes of sequences of joint PDF's sharing asymptotically the same scaling form for the sum 
of the variables. 

The limit theorems proved here have implications also outside the context of variables with permutationally invariant 
joint PDF's. Indeed, they were inspired by a recent proposal for the description of the time evolutions of financial 
indexes as stochastic processes pH l22l] . When dealing with such processes, one often considers time series in which 
each term represents the increment of an additive collective variable in an elementary time interval. Examples are 
the displacement in diffusion, or the logarithmic return of a financial asset. In these cases, causality imposes that 
the successive increments must constitute a sequence of random variables, in which the statistical properties of each 
variable are independent of the successive ones. When the increments are correlated and the processes have the 
property of self-similarity, i.e. when the collective variable distribution obeys scaling not just asymptotically, but for 
any finite number of summands, there are some requirements whose satisfaction has to be imposed to the joint PDF's 
of the successive increments. An heuristic way of satisfy ing these requirements was recently proposed as a basis for 
a stochastic model of the dynamics of financial indexes [U, [22[ ■ As we show in this work, the heuristic proposal in 
[2lL I22I is fully justified on the basis of the novel notion of stability implied by our theorems. 

In general our stochastic processes are non-stationary and the scaling has a time-inhomogeneous nature [23| . When 
they become stationary, their increments also constitute sequences of exchangeable random variables. In such cases 
it is not possible to reproduce the statistics of these variables by empirical time-averages along infinitely long, single 
realizations of the processes. This is due to a mechanism of ergodicity breaking implied by de Finetti's representation 
theorem [Tt], . A way out of this difficulty is found when considering self-similarity as a property of the process valid 
within a limited, although possibly large, range of time-scales. This attitude is fully legitimate in many applications 
[23 |. We show here, by a dynamical simulation strategy of wide use in finance [25| . how ergodicity can be restored in 
the process, by requiring scale-invariance to hold only up to a finite upper cutoff in time. 

This Article is organized as follows. In the next three Sections, we introduce the formalism and present our main 
results about the limit theorems. We enunciate these theorems and give full details of their derivations in the Appendix. 
After stressing the applicability of our approach to the forms of anomalous scaling emerging, e.g., in the context of 
critical phenomena, in Sections fVl and [VTl we discuss implications of our results for the theory of stochastic processes. 
In particular, we present a class of non-Markovian self-similar processes possessing the requisites recently postulated 
[2ll . I22I for the case of finance and allowing explicit analytical calculations and efficient simulation strategies. The 
last Section is devoted to conclusions. 



II. ANOMALOUS SCALING 



Let us consider, for any given N = 1,2,3,..., a set of random variables, Xi, with i = 1, 2, . . . , N, taking values 
Xi on the real axis. We call pn(xi, . . . , xjv) the joint PDF of X-th set of variables and, to start with, assume that 
for any N this function is invariant under arbitrary permutations of its arguments. It should be stressed that, e.g., 
the random variable X\ belonging to a set with N variables and the X\ belonging to another set with N 1 7^ N 
variables are not identical, in general. Thus, in principle we should denote the variables in the X-th set by x[ N \ 
i = 1, 2, . . . , X, and their values by x\ . However, in order to keep formulas simple, we will not adopt this notation. 
Ultimately the identity of each variable Xi will be specified by the joint PDF pn(x±,X2, ■ ■ ■ ,Xi, . . . ,xn) used in 
order to evaluate its statistical properties. In this way, our formulas will conform to the standards of the statistical 
mechanics literature [lol [ill flij . To further simplify the formalism we can require, without loss of generality, that for 
any X all the variables have zero average, (Xi) PN — Vi, where ((•)) PJV = J dxi ■ ■ ■ efejv (■) Pn(xi, ■ ■ ■ , %n)- For the 
sum Yjsr = X\ ^ + Xjy, whose PDF is 



PY N (y) = / dxi ■ ■ -dx N 5(y - x x - . . . - x N ) p N {xi, . . . ,x N ), (1) 
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this also implies (Y5v) py = 0. We are interested in cases in which the sequence pn(x±, . . . , xn), N = 1, 2, . . . is such 
that py n satisfies anomalous scaling for N — > oo, i.e. 

N D p YN (N D y)^g(y), (2) 

where g is a scaling function, and D is a scaling dimension. We want to identify whole domains of pat's such that 
the py n satisfies Eq. ((2|) with a given g and a given 13. Besides the kind of convergence, the class of admissible g's 
and the range of D's needs to be specified. As we discuss below, examples of j?y n 's such that Eq.Q holds are easily 
found in statistical physics. 

We first clarify why the exponent values D = 1/2 and D — 1 naturally arise for sequences of exchangeable variables. 
Let us suppose that (Y^) PN is finite for any N. Since permutational invariance implies (Xi) PN = (Xi) PN Vi, and 
(XiXj) PN = (X 1 X 2 ) PN Vi^j, one has 

(Y 2 ) PN = N(X 2 ) PN +N(N- 1)<M 2 ) PW . (3) 

On the other hand, if, as appropriate for sequences of random variables, the sequence of joint PDF's pn is constructed 
consistently with the condition 

PN-i{x\,...,xn-\) = J dx N p N (x 1: . . . , xn), (4) 

where N > 2, it is clear that (Xi) PN and (XiX2) PN do not depend on N. Since according to the scaling condition in 
Eq. {2} (Y$) PN ~ N 2D , Eq. J3]) implies that either D = 1/2 and {XiX 2 ) Pn = 0, or D = 1 and {XiX 2 ) Pn > 0. In 
the former case, further restrictions on the averages of products of X's apply if higher moments of Yjy are assumed 
to exist. We should stress that if Eq.(0J is satisfied by the sequence of permutation- invariant joint PDF's, then these 
PDF's in turn define a sequence of exchangeable variables. Indeed, Eq.Q guarantees that a given variable, say Xi, 
is strictly the same random variable, independent of the set of N variables within which it is considered. 

As discussed in Section IIV1 there are cases, for example in statistical mechanics, where one considers a system 
in equilibrium at a given temperature, so that pn represents the canonical joint PDF of N variables describing the 
degrees of freedom of the system. Since pn is expressed as a ratio between the Gibbsian weight and the partition sum, 
upon integrating pn over one of the ./V variables, as a rule we do not obtain the joint PDF of a system in equilibrium 
at the same temperature and with just N — 1 variables. Indeed, tracing over one of the variables leads to effective 
interactions which are not present in the Hamiltonian for N — 1 variables. The modern theory of critical phenomena 
shows that the renormalization effects determining this difference lead to anomalous scaling at the critical point 0]. 
This circumstance, which is expected to occur in many cooperative phenomena, will allow us to derive limit theorems 
for sums of exchangeable variables with general values of D. 

On the other hand, in problems where N represents the number of increments over successive time intervals of a 
stochastic process and pn-i and pm are respectively the joint PDF's of the first N — 1 and N increments, causality 
imposes to consider sequences of p^'s satisfying Eq. ([!]). Below we will also show how the stability conditions implied 
by our limit theorems allow to define sequences of random variables whose joint PDF's satisfy Eq.([3]) and whose 
aggregated increment Yjv satisfies anomalous scaling exactly for any N . 



III. ILLUSTRATION OF THE MAIN RESULTS 



We report our main statements and their mathematical proofs in the Appendix. Here we rather choose to illustrate 
the meaning and some implications of our results. Let us first consider pn oi the form 

/+oo N 
dfi \(n) Y[ 1 \Xi - N i_ D ) > (5) 
-°° i=i 

where A and I are single-variable PDF's. With no loss of generality we require (fi)x — 0, whereas we assume (X)i = 
and (X 2 )i = 1. The higher integer moments of I, are left arbitrary. Clearly, the pn in Eq. §5§ is a positive density 
normalized to 1, and invariant under permutations of its arguments. The X^s are dependent, since pn does not simply 
factorize into a product of single variable PDF's. The choice of considering pn's which are convex combinations of 
products of single- variable PDF's is motivated by the fact that in this way it is possible to demonstrate the existence 
of asymptotic scalings with very general scaling functions. Indeed, in the Appendix we show that with the joint PDF 
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in Eq. (O, py N satisfies Eq. ([2]) with a scaling exponent D > 1/2. The scaling function g is determined by A. For 
D = 1/2 g is given by 

g{x) = dfx\{fi) L - — -== (6) 

J-oo V 27T 

whereas g coincides with A itself if D > 1/2. In both cases, upon varying A the scaling function g assumes general 
shapes. For instance, it may have several local and global maxima and power law decays to zero at large positive x, 
and/or —x, as required in many applications. 

As anticipated above, when the variables X^s are dependent and do not constitute a sequence, it is legitimate to 
introduce in the definition ol pn the iV-dependence arising from the fact that fi enters divided by N X ~ D . In particular, 
precisely this dependence implies that the joint PDF of a system with N — 1 variables, pn-i, rather than satisfying 
Eq. (|U), is linked to pn by the relation: 

N _ ]\ (N-l)(l-D) „ f/N- 1 X {1 ~ D) 




p N -i(xi, . . .,x N -i) = y — j J dx N PN N j Xx,-.. 

Consistently with the fact that the X,'s are not constituting a sequence of random variables, the marginal PDF of 
each individual Xi, 

Px it N{xi)= dx 1 ---dx i _ 1 dx i+1 ---dx N p N (x 1 ,...,x N ), (8) 



depends clearly on N (N > i). So, if the second moment of A is finite, one realizes that px t ,N has a finite width for 
N — > oo when 1/2 < D < 1. If D > 1, this width diverges in the large N limit. Such a divergence makes full sense in 
a correlated context. Indeed, in relation to the anomalous character of the scaling, the marginal single- variable PDF's 
play here a role analogous to that of single-variable PDF's in the independent case. For example, with independent 
variables one allows the single variable PDF's to be of infinite width for any AT, in order to have an anomalous, Levy 
scaling limit [3, of py n - Here, with correlated variables, the dependence on TV entering in px it N an( i the consequent 
divergence of width for N — > oo and D > 1 play a qualitatively similar role in producing anomalous scaling. 

It is natural to ask what are the correlations of the variables X^s according to the joint PDF's defined in Eq. 
If (l^ 2 )\ exists, an easy calculation gives for example 

(XiXj) PN = ^-aij ( 9 ) 

for i 7^ j. In particular, the variables with permutation-invariant joint PDF's as in Eq. ([5]) are linearly correlated for 
finite N. When 1/2 < D < 1 their linear correlators approach zero only asymptotically. 

Next, we consider more general scaling functions which can be expressed as convex combinations of Gaussians with 
varying centers /i and widths a. The form is 

f+°° f+°° , exp \-(x- fi) 2 /2<r 2 } , % 

g(x)= da d^(a,n) Pl { / _ % ' (10) 

JO J-oo V 27T(T Z 

where a € (0, oo), and ip is a PDF. The scaling exponent can be now any D > 0. Again, for the sake of simplicity we 
require (fx)^ = 0, while ip must be strictly equal to zero in a whole neighborhood of a = 0, for any fi. In the Appendix 
we prove that with the pat's constructed as follows: 

^ f°° j f + °° J ^TT Kxi/aN - 1 / 2 - v/aN 1 / 2 ) 
p N (x u ...,x N ) = da J dfj,ip{a,n) aN ^! 2 ' ^ ' 

with {X)i = and (X 2 )i — 1 as before, py N satisfies the asymptotic scaling ([2]) with g given by Eq. (fT0|) and the 
chosen D > 0. One easily verifies that if ip(a, fi) — p(a) 6(fi) the X variables are linearly uncorrelated for any N. In 
this case, with D = 1/2 the scaling limit of our theorem recovers known results valid for sums of random variables in 
exchangeable sequences (l9j . 

It should also be noticed that if we put *&(a, fi) = S(a — l)X(fi) and D = 1/2, one recovers the case discussed at the 
beginning of this section. 
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IV. PERMUTATION-INVARIANT JOINT PDF'S AND CRITICAL PHENOMENA 

All the cases discussed in the previous sections concern correlated variables whose joint PDF's for any N are 
permutationally invariant. At first sight, such feature may appear a too restrictive condition to be satisfied by realistic 
models, and applications may often require to release it. However, in the study of anomalous scaling variables of this 
kind may still play an important role. To illustrate this point, we consider the example of an Ising-like spin model, 
of the type often studied in the renormalization group approach to critical phenomena ||. Let us consider a system 
of N spins Si, i — 1 . . . ,N, where the index i labels the sites of a finite box of square or cubic lattice. The spins are 
supposed to take values Sj on the real axis. Equilibrium statistical mechanics allows in principle to construct the joint 
PDF of the N spin variables once given the spin Hamiltonian H({s}) and the temperature T. Since the spin variables 
are associated to the lattice sites, their joint PDF is not invariant under permutations. Indeed, for any configuration 
{si, S2, . . • , sn}, one has in general H(s^n-\, . . . , s^jv)) ^ H(si, . . . , sn), if 7r is a permutation of the N labels. This 
inequality holds because H is a sum of local interactions. Thus, also the canonical joint PDF 



_ exp[-iJ(si, . . .,s N )/k B T] 
ml^s'eM-HiWV/kBT] 



p N (si,...,s N ) = — w — — r TWr — - — (12) 



where k B is the Boltzmann constant, is not invariant under permutations of its arguments. On the other hand, when, 
e.g., discussing the critical behavior of the model, a key collective random quantity to be considered is the sum of 
all the spins X)t=i &i @> [HI Ell, which, in contrast, is invariant under any permutation of the spin labels, and 
is expected to have a PDF satisfying anomalous scaling in the thermodynamic limit [1(1 EH> EH HI] • This suggests 
to define what we call here a "permutation invariant representation" of the statistics of the model. Consider, for 
instance, the following definition of the joint PDF of new exchangeable variables X^s: 



N N 



Pn(xi,X 2 , ■■■, x N) = J-jj'J2 jYi ds ^Jv( s l> s 2, ...,S N ) f[ 6(Xj - S^j)), (13) 

7T J i=l j = l 

where the sum is extended to all the AH permutations it of the set {1, 2, . . . , N}. The Pat's defined by the projection 
operation in Eq. (|13[) are indeed invariant under permutations, while their sum Ypf — J^. Xi has a PDF identical to 
that of the total magnetization Si of the original system. On the basis of the same projection, one can also define 
an effective Boltzmann factor for the variables X^s in such a way that the partition function, and thus the free energy 
of the original problem, are preserved, too. Even if the computation of the effective Hamiltonian in terms of the Xi's 
is non-trivial, the above equations show that the asymptotic scaling of the PDF of J^i &i f° r a critical Ising-like model 
and that of Xi for its permutation invariant representation, coincide. It is also easy to see that one may construct 
different such representations of a given statistical model, all sharing the same free energy and the same PDF for Yjy. 

For a critical Ising system one expects an anomalous scaling for the PDF of &i with scaling dimensions 

D = 15/16 and D ~ 0.825 for square and cubic lattices, respectively [3j. Taking into account that finite size 
scaling for the critical Ising model implies (Q2i s i) 2 )p' N ~ N 2D , one also concludes that for the permutation invariant 
representation defined by Eq. ([13")) one must have (XiXj) PN ~ N 2D ~ 2 for N — > oo and i ^ j. As a matter of fact, the 
limit theorem in Eq. §5§ implies a scaling function for the PDF of Ym and linear correlations for the Xi's (Eq. ©) 
which are compatible with the asymptotic forms expected for the permutation invariant representation of the Ising 
model constructed here. 

The above discussion clarifies that correlated variables with permutation-invariant PDF's can be relevant in the 
statistical approach to anomalous scaling. This relevance stems from the fact that for these variables the constructive 
limit theorems presented here are valid. At the same time, additive collective variables like the total magnetization of 
a critical Ising model are considered in many studies of complex systems, also outside equilibrium statistical mechanics 
14]. 



V. NON-MARKOVIAN, SELF-SIMILAR STOCHASTIC PROCESSES 

In many phenomena, anomalous scaling is a statistical symmetry obeyed to a good approximation for more or 
less broad ranges of finite N's. The validity of limit theorems of the kind proved in the previous sections opens the 
possibility of defining joint PDF's consistent with an exact anomalous scaling of py N for any finite N, i.e. such that 
PY N =N D g(N D y). 

To illustrate how self-similarity for arbitrary finite N arises, let us consider the case of the central limit theorem 
for sums of independent random variables whose PDF has finite second moment. The asymptotic scaling is normal 
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and turns out to be an attractor in virtue of the stability property of the Gaussian PDF. In particular, this stability 
implies that if we consider a finite number of independent increments, X\, X 2 , ...,-Xjv, each one weighted by the same 
Gaussian PDF, the total increment X\ + X 2 + ■ • ■ + Xjf has also precisely a Gaussian PDF, having a width iV 1 / 2 
times the width of the individual increments. Thus, this PDF strictly satisfies normal scaling for any N. 

In an analogous way, the results obtained in the previous sections for sums of correlated variables allow us to 
construct joint PDF's of the X variables consistent with an exact anomalous scaling of py N , for any finite N. The 
generalized stability conditions implied by our limit theorems make this possible. To be concrete, let us consider the 
case of the scaling function in Eq. (fT7j)) . The construction of Eq. (fTTj) implies that if we define 

,+oc ,+oc n exp _ [x./aN - 1 / 2 - fi/aN 1 /' 2 ) 2 /2 
p N (xi,x 2 ,...,x N ) = I da I dfj, ijj(a, fi) Y[ y/2na 2 N 2D - ^ ' ^ 



o 



this joint PDF is consistent with an exact anomalous scaling of py N with scaling function g given by Eq. (|10[) and 
exponent D > 0, for any finite N. Since at empirical level py N is often the most accessible PDF of the system 
[2l|, (22|, such joint PDF's constructed in terms of g may be regarded as a model for the dependences determining the 
anomalous scaling in the range of N- values relevant for the phenomenon under study. 

In the following, let us deal with processes developing in (discrete) time and think of Xi as an increment relative to 
the time interval [(i — I) At, iAt], while the elapsed time of the process is t = NAt and At is the elementary time-step 
of the process. Clearly, if pn is the joint PDF of the first N increments of the same process developing in time, 
causality imposes the validity of Eq. @ for any N > 1 . The conditional PDF 

c / I \ p N (x 1 ,X 2 ,...,X N ) 

P n {x n \x 1 ,x 2 ,...,xn-i) = 7 r (15) 

p N -l{Xi,X 2 , ■ ■ ■ ,XN-l) 

(N > 2), expresses the PDF of the iV-th increment of the process, conditioned to the history of the previous N — 1 
ones. Like the joint PDF's, the conditional PDF's together with p\ embody the full information on the process. For a 
causal process with non-Markovian character, a property we should be ready to give up for the Xj's is the invariance 
under permutations of their joint PDF's. 

Referring again to an anomalous scaling with g as in Eq. (fT0|) and D > 0, it is not difficult to figure out how 
to modify Eq. (fTl| in order to obtain a discrete-time stochastic process possessing self-similarity for finite N. To 
this purpose, let us introduce the following coefficients: a, = [i 2D — (i — l) 2 - ] 1 / 2 and bi = i D — (i — 1) D , with 
i = 1, 2, . . .N, If we then define 

( \ [ + °°rl f +0 °J ,( ^ A ex P K^"^) 2 /2a 2 a 2 ] 

p N {x l ,x 2 ,...,x N ) = da dp, ip{a,n) II , (16) 

Jo J-00 ~[ ^2ira 2 a 2 

one can verify that this joint PDF indeed guarantees for any N a strict scaling for py N - 

N D p YN (N D y)=g(y). (17) 

Eq. (JT7J) holds because the coefficients and bi satisfy YljLi a | = N 2D and Ylj=\ ^3 = X D , respectively. The 
condition in Eq. Q is also respected. One recognizes immediately that for general tjj(a, /i) the pat's in Eq. (|16p 
are not permutation invariant anymore for any D > 0. The lack of such invariance is also evident in the fact that 
pXi = PXt,N V-/V > 1 now varies with i, reflecting a nonstationarity of the increments. 
When ' 



il>(tr,li)=p(v)5(ji) (18) 

with p(a) ^ S(ao), Yn amounts to a stochastic processes of the form postulated recently for the description of financial 
indexes' evolution (2ll. l22l| . In such a case, the increments are linearly uncorrelated and, up to an i dependent rescaling, 
their marginal PDF's coincide with g. The characteristic function of the scaling function g can be expressed as 
g(k) = J dapia) exp(— ak 2 /2), and has the remarkable property that it is converted into to a proper iV-dimensional 
joint characteristic function if k is replaced by \Jk\ + k 2 + ■ ■ ■ + k 2 N , for any N . Precisely this requirement has been 
identified in Refs. [2l|, [2^] as a natural one for the joint characteristic function of the successive returns of an index. 
A theorem due to Schoenberg [UUll states that the g(k)'s having the above form exhaust the class of characteristic 
functions with such property. In particular, the class of scaling functions from which one can construct explicit joint 
PDF's is specified. This class includes the form used in Ref. [2l| and also the Student distribution recently considered 
H in 
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VI. RESTORING ERGODICITY 

The ergodic properties of the dynamics of stochastic processes like those obtained using Eqs. (|16I18|) need to be 
analyzed in some detail. To be concrete, let us take ip as in Eq. (fT5|) , with an arbitrary p and D = 1/2. In this 
particular case, since the <Xj's are all equal, the increments constitute an exchangeable sequence and are stationary. 
Hence, the problem of ergodicity is clearly posed. The form of the joint PDF's in Eq. (p~6|) amounts to a convex 
combination of uncorrclatcd Gaussian increments with different er's. Any simulation of a single, infinitely long history 
(x%, X2, xn, ■ ■ ■) made on the basis of the sequence pi{x\) = g(xi), plO^l^i), ■■• , Pn( x n\xn-i, ■ ■ ■ ,xi), ... 
would not be apt to manifest the ensemble correlations implied by pn in Eq. (|16[) . Indeed, after an initial transient, 
the extraction of the successive increments would essentially be ruled by a Gaussian conditional PDF with an approx- 
imately constant a =W, chosen among all those allowed by p. A different simulation would pick up a different a in the 
initial transient stage and then proceed with independent increments extracted according to this a (see Appendix). 
The correlations implied by Eq. (|16p are reproduced only by putting together the results of an ensemble of a large 
number of different such simulations. A sliding time-interval sampling procedure along a single infinite history would 
not detect any correlations among the increments. This amounts to a breaking of ergodicity: The single infinitely-long 
realization of the process just isolates one of its possible uncorrelated ergodic components, a well known consequence 
of de Finetti's representation theorem for exchangeable variable sequences [TtJ • This lack of ergodicity appears at first 
sight to represent a serious limitation of the stochastic process, if like in finance a legitimate ambition is to simulate 
single long histories with the same correlation and scaling properties as the empirical one. 

It is possible to recover the anomalous scaling and the correlations implied by our construction of the joint PDF's 
using a suitably defined dynamics. Let us go back to the motivations mentioned above for considering self-similar 
processes: The approximate satisfaction of anomalous scaling for PDF's like that of the aggregated increment in a 
time interval of duration r is often valid for a limited range, t < M At. Under these premises, an adequate goal for the 
simulation is that of reproducing, by time-averages along a single dynamical trajectory, the scaling and correlation 
properties implied by Eq. (|16[) just over the time range M At. One way of obtaining these properties, namely 
ergodicity and self-similarity up to the time-scale M At, is by implementing an autoregressive dynamics [25[ with 
memory span equal to M. Imagine we have extracted, consistently with the conditional PDF's p\, z = 1,2,..., M, the 
first M increments of the additive variable Ym- Instead of using the conditional PDF Pm+i( x m+i\%m,Xm-i, ■ ■ ■ 
to extract the M + 1-th increment, we use p c m (xm+i\xm,xm-i, ■ ■ ■ ,£2)- Similarly, for any time t > M At we use 
this autoregressive scheme in which only the preceding M — 1 increments have an effect on the further evolution. In 
this way one circumvents the problem of broken ergodicity, because for finite M the conditioning input is constantly 
updated and modified to an extent which is sufficient for a long-enough simulation to span all the cr's allowed by the 
ensemble in Eq. ([16]) . With such strategy the empirical PDF of the sum of the increments over an interval r, sampled 
from all intervals of duration r along a single long history of the process, satisfies to a very good approximation the 
anomalous scaling for r < M At (see Appendix) . 

VII. CONCLUDING REMARKS AND PERSPECTIVES 

In this Article we have shown that the choice of variables with joint PDF's invariant under permutations is par- 
ticularly favorable for discussing the problem of the asymptotic emergence and universality of anomalous scaling 
due to correlations. Ideas of the modern theory of critical phenomena and complex systems are at the basis of the 
advancements we could present here. Our limit theorems cover indeed forms of anomalous scaling, which, to our best 
knowledge, so far have not been treated by the probabilistic literature with the present generality. At the same time, 
classical examples taken from the theory of critical phenomena gave us a way to illustrate the role variables with 
permutation invariant joint PDF's can play in more general problems with anomalous scaling. 

As remarked above, the idea of basing limit theorems for correlated variables on some suitable generalization of 
the standard multiplication has some appeal jig . The rules by which we compose the I PDF's to obtain p^ in Eqs. 
([5]) or (jlip . retain in fact the commutative and associative properties. In this respect, our approach to anomalous 
scaling is quite different from the renormalization group one, and remains closer in spirit to the limit theorems for 
independent variables. This closeness is also manifest in the relative simplicity of our proofs, which directly rely 
on the corresponding ones for the independent case. Thus, the mathematics at the basis of the standard central 
limit theorem plays a fundamental role also outside the context of independent variables. Another difference of our 
approach compared to the renormalization group is that we do not need to make use of the hierarchical modeling to 
have analytical control on statistical coarse-graining operations. Here we replace the hierarchical paradigm by the 
assumption of invariance under permutations. In principle, this replacement still allows to address realistic scalings 
as illustrated in Section [TV] 

We have expressed our limit PDF's for the (rescaled) sums of correlated random variables as convex combinations 
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of Gaussians with varying widths and/or centers. Scaling functions belonging to this class have been considered very 
often in phenomenological descriptions of anomalous scaling [29j , but their possible implications as far as correlations 
are concerned were not stressed enough, in our opinion. The wide classes of scaling functions and the continuous 
ranges of scaling exponents identified through our theorems, definitely do not support the idea that in the context of 
strongly correlated variables relevant scaling forms could be organized in a restricted set of universality classes. In 
particular, there does not appear to exist one or few particular scaling functions playing a universal role similar to 
the one of the Gaussian in the independent case. 

The generalization of the notion of stability implied by our theorems naturally leads to the introduction of self- 
similar stochastic processes with correlated increments. These include in particular the process proposed in Ref. 
PH as a model of index evolution in finance. Besides giving this proposal a rigorous basis, the results presented 
here, especially those concerning the restoration of ergodicity, substantially enhance the analytical and numerical 
tractability of such a process. 
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VIII. APPENDIX 



In the first part of this Appendix, we prove three different statements which in particular imply that py N , the PDF 

N D p YN (N D y)^g(y), (19) 



of Yn = YliLi satisfies the scaling 



for N — > oo (refer to main text for details). 

Limit Theorem for g's given by Gaussian mixtures with different centers and D = 1/2 

Given the sequence of joint PDF's 

r+oo N 

Pn(xi,x 2 ,-..,x n ) = dfi X(fi) Y[ Nl _ D ) , AT = 1,2,... (20) 

J -° c i=i 

for the random variables {Xi} i=1 2 N , where D = 1/2, A and I are single-variable PDF's with (fj)\ = and (X)i = 0, 
(X 2 )i — 1, then as N — > oo the probability 

Prob \Y.W5 <4 -» / dw 9M (21) 



uniformly, with 



N D 

. i=l 



/■+» exp l-{w-tf/2\ 
(w) = / dfi X(fi) L - ■= (22) 

J-oc V27T 



exp[-( w - M ) 2 /2] 



Let us consider the positive quantity 

which, once multiplied by A and integrated with respect to fi, yields the probability that '^Z i Xi/N 1 / 2 < z. The 
following identity holds: 



(24) 
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The central limit theorem for independent variables guarantees [l|, H| that the right hand side of Eq. (|24|) converges 
uniformly to 

rfw exp H^) 2 /^ 

Since the uniform convergence holds for z and fi separately, we can interchange the integration in \x with the limit for 
N — > oo and get 

fy^)Prob^^j- 2 < *} - |j W / + J^AM CXP[ ' ( ^ M)2/2] , (26) 

still uniformly in z. This proves the asymptotic scaling (flU)) of py^, with L> = 1/2 and 5 as in Eq. (|2"D|) . 

Limit Theorem for g's given by Gaussian mixtures with different centers and D > 1/2 

Here, we establish a similar result with I? > 1/2. Let us look back at Eq. (T2H)) . where we have a convex combination 
of Gaussians with finite second moment equal to 1. Suppose to perform a limit in which this second moment is sent 
to zero: In this limit the Gaussian would approach a Dirac delta-function. Hence, we would have 

g(x) = X(x). (27) 

In order to construct ppj such that py N satisfies asymptotic scaling with g = A and with D > 1/2, it is convenient to 
consider the characteristic functions of pm, and of g, respectively: 

N 

p N (ki,...,k N ) = Yldxiexp(-ikiXi)p N (xi,...,x N ), (28) 

i—l 

g(k) — J dw exp (—ikw) g(w). (29) 

We can prove the following statement. 
Given the sequence of joint PDF's 

r+oc N 

P N (xx,x 2 ,...,xn) = / dfj, X(fx) Y[ l(xi- m _ D ) , 2V = 1,2, ... (30) 

"'- 00 i=i 

for the random variables {Xi} i=1 2 N , where D > 1/2, A and I are single-variable PDF's with (fi)\ = and (X)i = 0, 
(X 2 )i — 1, then as N — > 00 we have 

(31) 

with 

g{w) = A(w). (32) 

The convergence is uniform in k if \X{k)\ decays at large \k\ as l/|fc| 2 or faster, uniform in k in any bounded subset 
of M. otherwise. 

Indeed, the characteristic function of such pm is 

r+°° , , N 

p N (ki,...,k N ) = / dp,X(n) exp -i{k\ H h fcjv) n m_ d s J\ Kb), (33) 

J —00 iv j 



i=l 



where Z is the characteristic function of I. We can write 
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If we assume D > 1/2, l(k/N D ) N approaches 1 for N — > oo, uniformly in k in any bounded subset of R. This implies 
that as iV — ► oo , 

which proves the theorem. The convergence in Eq. (|35ll is uniform in k if |A(fc)| decays at large \k\ as l/|fc| 2 or faster, 
uniform in k in any bounded subset of R otherwise. 

Limit Theorem for g's given by Gaussian mixtures with different centers and widths, and D > 

Given the sequence of joint PDF's 

PN{Xi, ■ ■ ■ ,x N ) = I da I dfJLip{a,fj,) || n d-i/2 ' ^ 



/ da I dfJLip(a,fj.) Y\ 

JO J -oo i=1 



for the random variables {Xi} i=1 „ N , where D > ; ip is a joint PDF identically equal to zero in a whole neighborhood 
of a — and such that (/it),/, = 0, and I is a single-variable PDF with (X)i — 0, (X 2 )i — 1, then as N — > oo the 
probability 

Prob \j2§h< z }^ fjwg{w) (37) 



uniformly, with 

r+°° /-+ 00 ex p |_ 

n(nA — I drr I dli.ihln. //.^ — 

\Jl-KG 



For any a and fj, we define the quantity 

( N \ „ 2 „ + OQ , » 



. 1=1 



— oo «/ — oo 



= f(z,N,D,^,a) (39) 
One easily verifies the following property: 

/ (z, N, D, M , a ) = f(--^,N, 1/2, 0, l) . (40) 

Under the present assumptions, the central limit theorem for independent variables [J 0] guarantees that the quantity 
on the right hand side of Eq. (|40[) converges uniformly to the limit 

In view of the conditions on ip, this uniformity holds for a, fj, and z, separately. We thus conclude that 



y At y V^o-, M) ^06 | ^ X 4 //V 1/2 < z j 



dw I da I dfj,t/}((T,fi) 6XP ^ ^r— = ^ g - , (42) 

) V27TC7 Z 



uniformly in z. The uniformity follows from the hypothesis that ip is zero in a whole neighborhood of a = 0. 

If V'( <T ) A*) = p(°~)3(l J ')i i- e - with (7 given by a mixture of Gaussians of different widths and all centered in the origin, 
the X variables are linearly uncorrelated for any TV. 



Simulating a self-similar process with strongly correlated increments 

Here we discuss the issue of how to simulate the stochastic processes described in the main text. Let Xi be the 
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increment relative to the time interval [(i — l)At, iAt] of the discrete-time process Y/v = Ej=i ^-i> where At is an 
elementary time-step and t = N At the elapsed time. For the sake of definiteness, we assume i/j to be of the form 

1 p(a,p)=S(p) p(a), (43) 

with 

/ ^ A 6b3(j2 fAAS 

p{a) = A VWT^) (44) 

for a G (omin-, Vmax) (0 < <J m in < &max) and p{a) = elsewhere. The parameter A is just a normalization constant 
fixed such as J a . ma '° da p(a) = 1, whereas b determines (a 2 ) p . With such a choice, the scaling function 

rvmax 6b 3 2 \-x 2 /2a 2 ] , 

g(x) = / da — — - L ; ' — j -, 45 

is even. With a sufficiently large a max , we can mimic a fat-tail power-law decay for g of the kind g(x) ~ l/|a^| 4 at 
large arguments. We first address the situation in which the problem of breaking of ergodicity is well posed, i.e., 
when the increments Xi's of the process are stationary, so that it makes sense to compare their ensemble and time 
averages. Later, we will comment about the more general case. We thus fix D — 1/2. With the choice (j43]) for ifj, 
this also implies that the Xj's are exchangeable. Indeed, according to Eq. (fTo| of the main text, the joint PDF for 
the increments of the process becomes 

/•o-maa e -(x 2 1 +-+x 2 N )/2a 2 

" vLr; rVi 1 . d %(b* + aS) {2ira 2f/2 ' (46) 
and a straightforward calculation yields 

[<^) P -(0,(^) P ] (4g) 



B a+p (a«+r j ) p -B Q B p (a") p (a0) 
V i, j = 1,2, . . . ,7V, with 



+oo -x 2 /2 

S a = / dx \x\ a —J^- (49) 



— oc 



Notice that when a max — > oo, (a a+ P) p is finite only for a + (3 < 3. The strong correlations among the increments 
are reflected by the fact that the C a p{i : j) is different from zero. On the other hand, (XiXj) PN = V j ^ i, and the 
process is uncorrelated at linear level. Hence, 

r< /„• a\ — (XjXj) PN — (Xi) Pl (Xj) Pl 

Llin{l,j) = T~\p2\ / X \ TX\ (50 ^ 

\ A i /Pi ~ \ A «/Pl\ A i/pi 

is equal to 1 for j = i, and zero otherwise. 

A natural strategy of simulation of the process is based on extracting the random increments x±, X2, ■ ■ ■ , xn , ■ ■ ■ 
according to the sequence of conditional PDF's 

pi(xi) = g(xi), pl(x2\xi), . . . , Pi(xi\xi-i, . . . ,xi), (51) 

respectively. To distinguish with what follows, we call this simulation scheme "progressive" . An ensemble of a large 
number of independent simulations of this kind reproduces well all the theoretical features of the process. For instance, 
in Fig. we find that Cu n (l,j) and Ci i(l, j) obtained from an ensemble of 10 5 simulations oscillate around the 
correct theoretical values. On the contrary, if we consider a single simulation of N steps (N 3> 1) generated according 
to Eq. (fSTj) . and the associated "sliding- window" correlators 



iv^EilT fc K-l Q K- +fe p-(^EiIil^r 


) (jV-fc Ei=l • 


Xi+kf) 




(iv-riliN* 3 ) 


i 



c ^ k ) - ; =z . . r. /, „v . . s,\ „v . .„n ( 52 ) 



1 T^N-k 

N-k 2jj=l x i%%+k 


( TV Ei=l x i 


) (jv-fc J2i=l ^i+fc) 


1 r» r 2 


(]V" Z)i=l x ') 


(lV Z)i=l x «) 



Cun(k) = — — - x (53) 
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FIG. 1: Ergodicity breaking for a progressive simulation. Correlations calculated from an ensemble of 10 s realizations (a) and 
from a single realization of 10 5 steps (b). Here, and in the following we use p(a) as in Eq. (I44f) . with a m in = 0.01, a max = 10, 
and b = l/y/2. 



we find that both Ci t %(k) and Cu n (k) are zero for k > (see Fig. [Ib). This means that time-averages disagree with 
ensemble-averages, i.e., the dynamics is not ergodic. 

We gain an insight into this ergodicity breaking by noticing that the conditional PDF for the next increment at 
each time-step i can be expressed in the following way: 



Pi(Xi\x 



i-l> 



da p c l {a\x l -i, ...,xi) 



-x 2 j2a 2 



V27 



(54) 



where the conditional PDF for the value a, is in fact a function depending only on X)i=i x \- 



Pi((r\xi 



,xi) = 



It 



+oo da , p(y_ 



= fi 



i-1 



(55) 



As i increases, very quickly fi becomes sharply peaked around a specific value er, which depends on the sum of the 
squares of the past increments, X)}=i x j- ^ or a given % ^> 1, the dynamics is such that the typical growth of X)}=i x "j 

with respect to Y^j=i x2 j compensates for the functional change of with respect to fi, and the new conditional 
PDF, pi + i, remains peaked around the same value a. In this way, a single ergodic component labeled by a is chosen 
during the initial stages of the simulation, when pi still resembles p. The subsequent dynamical evolution is then very 
similar to a process with independent increments at the initially selected a. This is why along a single history of the 
process the sliding-window analysis performed in Fig. [lb reveals a vanishing Ci i i(fc) for k > 0. 

In practice, a progressive simulation scheme can be realized by first extracting a a according to the PDF in Eq. (|55[) . 
and then an Xi from a Gaussian PDF with width a. With a different, autoregressive, simulation strategy, scaling and 
ergodic properties can be restored together within a good approximation up to a finite time-scale M . This is obtained 
by considering a conditional PDF p^ ar which depends, still through Eq. (|54p . on the previous M — 1 increments only, 
for all i > M: 



car/ i 

(Xi\Xi-i, 



-M+l) =p C M (Xi\s 



. , afj- M+i)- 



(56) 



After the initial transient of M time-steps, which is realized according to the progressive scheme in Eqs. (|5"T|) . using 
p\' ar at each step i > M we "forget" the increment Xi-M and we thus fix to M the dimension of the conditional 




FIG. 2: (a) Histogram of the frequency of ergodic components a in p\ : Only for the autoregressive simulation the histogram 
reproduces well p(a) in Eq. (|44|1 . (b) The rescaling of the histogram of the increments over an interval of duration r = k At 
for a single autoregressive simulation of 10 time-steps with M = 100 reproduces g(z) for k < M. 



PDF for extracting the next increment of the process. This enables the conditional PDF p c M to wander among all the 
ergodic components labeled by the different it's, as it is shown in Fig. [5^, where we recorded the histogram of the cr's 
in Eqs. (|54I56[) spanned by both a progressive and an autoregressive (M = 100) simulation of 10 5 time-steps. While 
in the progressive case the histogram is strongly peaked around a single a, in the autoregressive one it well reproduces 
the p(a) assumed in Eq. ([44]) . We define the increment over an interval of duration r = k At at time t = i At as 
Z ik = Y i+k -Yi (i = l,2,...,N-k,k>0), and then we sample the the PDF p Zh {z) = 7^ YnJi* Pz lk (z) along 
a single autoregressive history of N steps. For an ergodic dynamics it is expected that the scaling properties of pz k 
reproduce those of py k ■ Fig. [2j3 shows that indeed the desired scaling properties for pz k , 

k D PZk (k D z)=g(z), (57) 

are well satisfied for D — 1/2 and k < M. The fidelity and the ergodicity of the autoregressive simulation are 
furthermore supported by an inspection of C a p(i,j) and C a p(k), which reveals that both the ensemble and the time 
correlations approximatively coincide with the theoretical values as long as j — i and k are smaller than M , respectively 
(Fig. [3K,b). For larger time separations, correlations slowly decay to zero, producing a smooth crossover to a process 
with independent increments on scales much larger than M. 

1/2 

For simulations with D 7^ 1/2, by considering the rescaled variables X\i' = Xi/ai, with <ij = [i 2D — (i — 1) 2D ] 
(see main text), the above discussion still applies. As a consequence, the mechanism of the selection of a specific 
value a = a in pi for a single progressive simulation and that of the dynamical sampling of the various cr's for a single 
autoregressive one remain valid also when D ^ 1/2. These features are of crucial importance for the applicability of 
such kind of processes in finance [23| . 
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